Image processing apparatus, image processing method, and program

ABSTRACT

An image processing apparatus includes a calculation unit configured to calculate information indicating similarity among a plurality of tomographic images, and a generation unit configured to generate a tomographic image from the plurality of tomographic images based on the calculated information indicating similarity.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of co-pending U.S. patent application Ser. No. 13/702,705, filed Dec. 7, 2012, which is a U.S. national stage application of International Patent Application No. PCT/JP2011/002995, filed May 30, 2011, which claims the priority benefit of Japanese Patent Application No. 2010-131257, filed Jun. 8, 2010, all of which are hereby incorporated by reference herein in their entirety.

TECHNICAL FIELD

The present invention relates to a technique for the enhancement of image quality of a tomographic image in an eye portion.

BACKGROUND ART

A tomographic imaging apparatus for the eye portion such as an optical coherence tomography (hereinafter referred to as OCT) is usable to three-dimensionally observe a state inside a retinal layer. In recent years, the tomographic imaging apparatus has attracted attention because it is useful in accurately diagnosing disease.

In an ophthalmologic diagnosis, there can be used a volume image for grasping the state of the entire retinal layer and a high image-quality two-dimensional tomographic image for grasping a layer which is not imaged on a low image-quality tomographic image. The volume image refers to aggregation of the two-dimensional tomographic images.

The image quality of the tomographic image acquired by the OCT depends on the strength of near infrared light incident on a retina. For this reason, the strength of the light with which the retina is irradiated needs to be increased to improve the image quality of the tomographic image. However, from the viewpoint of safety, the strength of the light with which the retina can be irradiated is limited. This requires the high image-quality tomographic image to be generated while the retina is being irradiated with near infrared light within the unquestionable strength of the light from the viewpoint of safety. To meet such a requirement, Japanese Patent Application Laid-Open No. 2008-237238 discusses a technique in which two-dimensional captured tomographic images are superposed on each other to generate a noiseless sectional image.

In Japanese Patent Application Laid-Open No. 2008-237238, however, a plurality of tomographic images are merely subjected to arithmetic mean. For this reason, in a case where correlation is low between the added tomographic images, a reduction in diagnosis information can be increased. In particular, since involuntary eye movement occurs in eyes, the entire areas of adjacent images are not always similar.

SUMMARY OF INVENTION

The present invention is directed to an improvement in quality of a tomographic image.

According to an aspect of the present invention, an image processing apparatus includes calculation means configured to calculate information indicating similarity among a plurality of tomographic images, and generation means configured to generate a tomographic image from the plurality of tomographic images based on the calculated information indicating similarity.

Further features and aspects of the present invention will become apparent from the following detailed description of exemplary embodiments with reference to the attached drawings.

BRIEF DESCRIPTION OF DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate exemplary embodiments, features, and aspects of the invention and, together with the description, serve to explain the principles of the invention.

FIG. 1 illustrates a configuration of an image processing system according to a first exemplary embodiment of the present invention.

FIG. 2A is a flow chart illustrating a tomographic image capturing process in an image processing apparatus according to the first exemplary embodiment.

FIG. 2B is a flow chart illustrating the tomographic image capturing process in the image processing apparatus according to the first exemplary embodiment.

FIG. 3A is a diagram for describing superposition image generation processing.

FIG. 3B is a diagram for describing the superposition image generation processing.

FIG. 3C is a diagram for describing the superposition image generation processing.

FIG. 4 is a diagram for describing superposition areas.

FIG. 5A is a diagram for describing superposition determination processing.

FIG. 5B is a diagram for describing the superposition determination processing.

FIG. 5C is a diagram for describing the superposition determination processing.

FIG. 6 is a block diagram illustrating a configuration of an image processing system according to a second exemplary embodiment of the present invention.

FIG. 7A is a flow chart illustrating a tomographic image capturing process in an image processing apparatus according to the second exemplary embodiment.

FIG. 7B is a flow chart illustrating the tomographic image capturing process in the image processing apparatus according to the second exemplary embodiment.

FIG. 8A is a diagram for describing superposition areas.

FIG. 8B is a diagram for describing the superposition areas.

FIG. 8C is a diagram for describing the superposition areas.

DESCRIPTION OF EMBODIMENTS

Various exemplary embodiments, features, and aspects of the invention will be described in detail below with reference to the drawings.

FIG. 1 illustrates an image processing system 100 including an image processing apparatus 110 according to a first exemplary embodiment of the present invention. As illustrated in FIG. 1, the image processing system 100 is configured such that the image processing apparatus 110 is connected to a tomographic imaging apparatus 120 via an interface.

The tomographic imaging apparatus 120 is the one that captures a tomographic image of an eye portion and is comprised of a time-domain OCT or a Fourier-domain OCT, for example. The tomographic imaging apparatus 120 is a known apparatus, so that the detailed description thereof is omitted herein.

The image processing apparatus 110 includes an acquisition unit 111, a storage unit 112, an image processing unit 113, and a display control unit 114.

The acquisition unit 111 acquires a tomographic image captured by the tomographic imaging apparatus 120 and stores the captured tomographic image in the storage unit 112. The image processing unit 113 generates a new two-dimensional tomographic image from the tomographic images stored in the storage unit 112. The display control unit 114 performs control for displaying the processed image on a monitor (not illustrated).

A plurality of places may be sequentially scanned. As an example, however, FIG. 3A illustrates a schematic diagram of two-dimensional tomographic image groups in a macula lutea portion in which the tomographic imaging apparatus 120 continues repetitively scanning and imaging almost the same place. A direction in which scanning is performed with a measuring beam to capture a two-dimensional tomographic image is referred to as a main scanning direction. A direction orthogonal to the main scanning direction is referred to as a sub-scanning direction.

In general, the tomographic imaging apparatus 120 performs imaging in the main scanning direction while shifting the measuring beam in the sub-scanning direction. In other words, the present exemplary embodiment is also adaptable to a case where the measurement light is shifted in the sub-scanning direction.

In FIG. 3A, coordinate axes x and y and time base t are provided. FIG. 3A also illustrates two-dimensional tomographic images T₁ to Tn in the macula lutea portion imaged at different points of time. In other words, the two-dimensional tomographic image group is formed of aggregation of the two-dimensional tomographic images in which almost the same place is captured.

The enhancement of image quality refers to the improvement of S/N ratio. The improvement of image quality refers to the improvement of S/N ratio.

The processing by the image processing apparatus 110 according to the present exemplary embodiment is described below with reference to a flow chart in FIGS. 2A and 2B.

In step S201, a control unit (not illustrated) controls the tomographic imaging apparatus 120 to capture a retinal layer. A position in the direction of depth (a direction z in FIG. 3A) being a direction in which the retina is irradiated with the measuring beam and the direction of plane (a direction x in FIG. 3A) orthogonal to the direction z is adjusted. The alignment of position of direction of depth corresponds to the alignment of position of a coherent gate for acquiring a tomographic image.

In step S201, the control unit (not illustrated) performs adjustment to a position suitable for imaging the retinal layer. In step S202, an imaging instruction unit (not illustrated) issues instructions to start imaging.

In step S203, when an operator issues instructions for imaging, the control unit (not illustrated) repetitively scans almost the same place to capture a plurality of tomographic images.

The control unit (not illustrated) also has a function to adjust the distance of movement in the sub-scanning direction.

In step S204, the image processing unit 113 generates a new two-dimensional tomographic image using the two-dimensional tomographic image group stored in the storage unit 112. The generation of the two-dimensional tomographic image whose image quality is enhanced is described below with reference to FIG. 2B.

In step S210, a first alignment unit 115 aligns the two-dimensional tomographic images with one another. The alignment is performed such that an evaluation function indicating the similarity of two two-dimensional tomographic images, for example, is previously defined and the tomographic image is deformed so that the value of the evaluation function is maximized. A method for performing evaluation by a pixel value (a method for performing evaluation using the correlation coefficient of equation (1), for example), is used as the evaluation function. Processing in which translation or rotation is performed using affine transformation or magnification rate is changed is used as processing for deforming an image. The alignment may be performed on the basis of a feature point. A feature portion such as a retinal layer or a lesion is extracted from the two-dimensional tomographic image, for example. Since an internal limiting membrane, nerve fiber layer, visual cell inner-outer segment junction, and stratum pigmenti retinae are high in luminance and the boundary of the layer is high in contrast, the detection results on the layers and the feature portions are used to perform the alignment.

A deformation parameter is calculated at which the evaluation function is maximized while deforming the two-dimensional tomographic image to align the two-dimensional tomographic images with one another. If the number of two-dimensional tomographic images to be superposed is N, N−1 two-dimensional tomographic images are aligned with the two-dimensional tomographic image serving as a reference.

In step S211, a determination unit 117 determines A-scan images superposed in units of the corresponding A-scan images. This process is described below with reference to FIG. 4. Two-dimensional tomographic images Ti′ and Ti+1′ captured at different points of time are aligned in step S210. Aij′ and A(i+1)j′ represent the corresponding A-scan images in the aligned two-dimensional tomographic images Ti′ and Ti+1′. The A-scan image in the present exemplary embodiment refers to one pixel row parallel to the direction of the z axis in FIG. 4.

The A-scan image is the one that agrees with the direction in which a measuring beam is incident. Each of the A-scan images acquired from the same place has almost the same image information. For this reason, even if the entire two-dimensional tomographic images are low in similarity due to involuntary eye movement, data mutually high in similarity can exist in the A-scan images of different two-dimensional tomographic images.

Image areas Rij′ and R(i+i)j′ each represent an area of +−.alpha. in the direction of the x axis with the A-scan images Aij′ and A(i+1)j′ as a center. A calculation unit 116 calculates a similarity between the A-scan images in the image areas Rij′ and R(i+1)j′ with the A-scan image as a center. In a case where a reference area for determining superposition is taken as the area Rij′, the calculation unit 116 calculates a similarity between the areas corresponding to all of the areas Rij′ to Rnj′. A formula using a correlation coefficient as an evaluation function indicating a similarity between the A-scan images is expressed by an expression (1).

$\begin{matrix} \left\lbrack {{Math}.\mspace{14mu} 1} \right\rbrack & \; \\ \frac{\int{\int_{S}^{\;}{\left( {{f\left( {x,y} \right)} - \overset{\_}{f}} \right)\left( {{g\left( {x,y} \right)} - \overset{\_}{g}} \right)\ {\mathbb{d}x}{\mathbb{d}y}}}}{\sqrt{\int{\int_{S}^{\;}{\left( {{f\left( {x,y} \right)} - \overset{\_}{f}} \right)^{2}\ {\mathbb{d}x}{\mathbb{d}y}{\int{\int_{S}^{\;}{\left( {{g\left( {x,y} \right)} - \overset{\_}{g}} \right)^{2}\ {\mathbb{d}x}{\mathbb{d}y}}}}}}}} & (1) \end{matrix}$

In FIG. 4, where the area Rij′ is taken as f(x, y) and the area R(i+1)j′ is taken as g(x, y), f and g   [Math. 2] represent the averages of areas f(x, y) and g(x, y), respectively.

The determination unit 117 selects an area used for superposition for each area. The processing by the determination unit 117 is described below with reference to FIGS. 5A to 5C. FIGS. 5A to 5C illustrate examples of results obtained by the calculation unit 116 calculating the similarity between areas.

The abscissa represents numbers 1 to N of the captured two-dimensional tomographic images. The ordinate signifies a similarity between a reference area and other areas in the two-dimensional tomographic images. FIG. 5A illustrates a method (a) for setting a threshold Th to select an area whose similarity exceeds a predetermined value. FIG. 5B illustrates a method (b) for selecting top M images of similarity when the areas in FIG. 5A are sorted in descending order of similarity. FIG. 5C illustrates a method (c) for representing rates of change of similarity after the sorting process by a line graph when the areas in FIG. 5A are subjected to a sorting process in descending order of similarity. Here is illustrated an example where images are selected before the rate of change of similarity is significantly decreased. More specifically, the similarity at which the rate of change of similarity after the sorting process shows the predetermined value is obtained to select the area whose value exceeds the obtained similarity.

In addition to the above, there is another method (d) for observing the rate of change of similarity without conducting the sort process. The calculation unit 116 stops calculating an evaluation value if the similarity is lower than a predetermined value. An image in which the evaluation value is higher than the predetermined value and the calculation thereof is still being performed is selected.

For the method (a) using the threshold, the area where the evaluation value is lower is not used for superposition, so that only the areas where the valuation value is high can be superposed.

This is suitable for the case where the movement distance per unit time in the sub-scanning direction is small. This is because the method has an effect in which an organization structure is changed a little but the image of the area which is greatly different in organization structure and is produced by unique movement or blinking of an eye is not selected.

For the method (b) selecting the fixed number M of images, the number of pieces of data subjected to superposition average processing in units of an image is not dispersed. The number of pieces of data subjected to superposition average processing in units of an A-scan image is also not dispersed. The extent that a noise is reduced can be leveled, so that the method (b) is suitable for the case where the image quality is equalized.

The method (c) using a rate of change is characterized in that the areas similar to each other can be selected even in the case where the image quality is inferior as a whole due to illness and the similarity is low as a whole.

The method (d) is suitable also for the case where the movement distance per unit time in the sub-scanning direction is large. This is because the calculation unit 116 can be prevented from unnecessarily calculating an evaluation value due to the reason that a displacement in the sub-scanning direction exceeding the predetermined value makes the organization structure of retina different. In other words, a change in the organization structure can also be recognized by observing the rate of change.

As described above, the area where superposition is performed based on the evaluation value is selected in each area. For this reason, in the case where the retinal layer is deformed due to the involuntary eye movement in the two-dimensional tomographic image or in an area where the image quality is partially lowered due to blink or shading is not used for superposition, the image quality of a newly generated image is enhanced.

The determination unit 117 performs processing of combining the above methods (a) to (d) according to the movement distance per unit time in the sub-scanning direction. For example, in the case where the movement distance is shorter than a predetermined value, the processing of the method (a) or (b) is used. In the case where the movement distance exceeds the predetermined value, the processing of the method (c) or (d) is used. A combination of the methods (a) and (d) can perform the processing attaching importance to speed. A combination of the methods (b) and (d) can perform the processing attaching importance to image quality.

In step S212, a generation unit 118 performs superposition processing. The superposition of two A-scan images is described below. FIG. 3B is a diagram illustrating processing in which the two-dimensional tomographic images are processed in units of the A-scan image to generate one two-dimensional tomographic image as a composite image. In other words, an addition average processing of the A-scan images positioned on different two-dimensional tomographic images captured at different points of time is described as an example. FIG. 3C illustrates a high-quality two-dimensional tomographic image generated by performing the addition average processing using M pixels (two pixels in the example) for each pixel. In FIG. 3C, Aij″ is a new A-scan image calculated by performing the addition average processing of the corresponding A-scan images.

In FIG. 3B, Ti′ and Ti+1′ represent two-dimensional tomographic images in which the same cross section is captured at different points of time. Aij′ and A(i+1)j′ represent A-scan images in the respective tomographic images Ti′ and Ti+1′. The generation unit 118 performs the addition average processing of the A-scan images Aij′ and A(i+1)j′ to calculate the A-scan image Aij″ in FIG. 3C. The generation processing of a high-quality two-dimensional tomographic image (composite image) is not limited to the addition average processing. A medium value calculation processing or a weighted addition average processing may be used. All of the A-scan images (Ail′ to Aim′) of the reference image Ti′ are subjected to these processing. In the weighted addition average processing, for example, the aforementioned similarity is used as weight of addition.

Although the present exemplary embodiment describes the case where an area where a similarity is calculated is taken as Rij′ and the superposition processing is performed on Aij′ (in units of the A-scan image), the present invention is not limited to the above exemplary embodiment. The superposition processing may be performed in units of the area where a similarity is calculated, for example. Alternatively, the superposition processing may be performed on two-dimensional tomographic images. Furthermore, the superposition processing may be performed such that a similarity is calculated in units of the A-scan image with .alpha. as 0 (Rij′=Aij′ at .alpha.=0).

In step S205, the display control unit 114 displays the high-quality two-dimensional tomographic image generated in step S204 on a display unit (not illustrated). Although the above example describes that almost the same retinal area is scanned with the area fixed where the measuring beam of the tomographic imaging apparatus 120 is used for scanning, as described above, the present invention is applicable even if the entire retina is sequentially scanned.

As is understood from the above description, in the present exemplary embodiment, a similarity between the areas in a plurality of the aligned two-dimensional tomographic images is calculated using the corresponding A-scan images and a peripheral area and the area used for the superposition processing is determined in units of area. This allows a high-quality two-dimensional tomographic image to be generated because the area used for the superposition processing is determined in units of a partial area even in the case where the entire images are aligned but the retinal layer in the two-dimensional tomographic image is deformed due to involuntary eye movement.

According to the present exemplary embodiment, a high-quality two-dimensional tomographic image can be acquired even in the case where the two-dimensional tomographic image is deformed due to involuntary eye movement. The high-quality image refers to an image whose S/N ratio is made better than an image captured once. Alternatively, the high-quality image refers to an image in which the amount of information required for diagnosis is increased.

In the first exemplary embodiment, a similarity between the areas in the aligned two-dimensional tomographic images is calculated using the corresponding A-scan images and a peripheral area and the area used for the superposition processing is determined in units of area. In a second exemplary embodiment of the present invention, an area where the evaluation value is high in a neighbor area is searched in the aligned two-dimensional tomographic images using the A-scan images and a peripheral area to perform the superposition processing. According to the second exemplary embodiment, a general alignment is performed using a general characteristic, and alignment is performed using a local characteristic. Thereafter, the superposition processing is performed.

FIG. 6 is a block diagram illustrating a configuration of an image processing system 600 including an image processing apparatus 610 according to the second exemplary embodiment. As illustrated in FIG. 6, the image processing apparatus 610 includes the acquisition unit 111, the storage unit 112, an image processing unit 613, and the display control unit 114. The units excluding the image processing unit 613 have functions similar to those in the first exemplary embodiment, so that the description thereof is omitted herein.

In the image processing unit 613, a second alignment unit 619 performs a local alignment using the A-scan images and a peripheral area.

The processing by the image processing apparatus 610 according to the present exemplary embodiment is described below with reference to FIGS. 7A and 7B and FIGS. 8A to 8C. The steps other than step S704 are similar to steps S201 to S203 and step S205 in the first exemplary embodiment, so that the description thereof is omitted herein.

In step S704, the image processing unit 613 performs a general alignment and a local alignment to perform image superposition processing, thus generating a high-quality two-dimensional tomographic image. The generation processing of the high-quality two-dimensional tomographic image is described below with reference to FIG. 7B.

In step S710, the first alignment unit 115 aligns two-dimensional tomographic images with each other. The processing is similar to step S210 of the first exemplary embodiment, so that the description thereof is omitted herein.

In step S711, the two-dimensional tomographic images generally aligned in step S710 are aligned in units of the corresponding A-scan image to perform a local alignment. The processing is described below with reference to FIG. 8A. Ti′ and Ti+1′ represent images in which the same cross section is captured at different points of time, which are the two-dimensional tomographic images aligned in step S701. Aij′ represents an A-scan image in the two-dimensional tomographic image Ti′. The shaded area Rij″ represents a range of +−.alpha. in the direction of the x axis with the A-scan image Aij′ as a center. R(i+1)j″ represents a rectangular area corresponding to the range Rij″ in the two-dimensional tomographic image Ti+1′. S(i+1)j″ represents a search range for moving the rectangular area R(i+1)j″. If the reference A-scan image is taken as Aij′ and an area where a similarity is calculated is taken as Rij″, the rectangular area R(i+1)j″ in the search range S(i+1)j′ is scanned, and the calculation unit 116 calculates an evaluation value.

The calculation unit 116 calculates the correlation of pixel values between the areas Rij″ and R(i+1)j″, and the determination unit 117 evaluates the evaluation value. Alternatively, the boundary of a retinal layer is detected to detect the thickness of the layer. The calculation unit 116 calculates the evaluation value of similarity using the thickness of the layer. FIG. 8B illustrates how the similarity is evaluated using the thickness of the layer. FIG. 8B illustrates the boundary of a retinal layer (an internal limiting membrane L1 and a stratum pigmenti retinae L2) in the search range S(i+1)j″ and the thickness values 1 to 3 of the layer. The thickness of the layer on each A-scan image is acquired in the rectangular area R(i+1)j″. The calculation unit 116 compares the thickness of the layers in the areas Rij″ and R(i+1)j″ with each other to calculate the evaluation value. The layer used for calculating the thickness of the layer is not limited to the above-mentioned layer. Other boundaries of a layer such as a nerve fiber layer or a visual cell inner-outer segment junction may be used to compare the thickness of the layer.

Although the superposition determination processing performed by the determination unit 117 is omitted in the present exemplary embodiment, it may be determined whether superposition is performed based on similarity after alignment is performed in units of the A-scan image as is the case with the first exemplary embodiment.

In step S712, the generation unit 118 performs processing for superposing the corresponding A-scan images with each other. More specifically, the generation unit 118 performs superposition processing between the A-scan image positioned at the center of the rectangular area R(i+1)j″ where the evaluation value is maximized and the reference A-scan image Aij′.

In the present exemplary embodiment, the area Rij″ is set as a rectangular area of the range of +−.alpha. with the position of the A-scan image as a center. As a method for setting the area Rij″, .alpha. may not be fixed, but may be dynamically changed according to an image feature of the retinal layer captured in the two-dimensional tomographic image. FIG. 8C illustrates an example of setting the area. For example, in a case where the retinal layer is planar, the range of .alpha. is set wide (R(i+1)j″). In a case where the retinal layer is curved and a characteristic place exists in the two-dimensional tomographic image (a large number of vertical and horizontal edges exist), the range of .alpha. is set narrow (R(i+1)k″). The range of an area may be changed for each case according to the image feature or may be changed in units of the A-scan image in one two-dimensional tomographic image.

As is apparent from the above description, in the present exemplary embodiment, an area where an evaluation value is high in a neighbor area is searched in the aligned two-dimensional tomographic images using the A-scan images and a peripheral area to perform the superposition processing. This allows a high-quality two-dimensional tomographic image to be generated because the local alignment is performed even in the case where the entire images are aligned but the retinal layer in the two-dimensional tomographic image is deformed due to involuntary eye movement.

The foregoing exemplary embodiments realize the present invention as an image processing apparatus. The exemplary embodiments of the present invention, however, are not limited only to the image processing apparatus. The present invention can be realized as software operating on a computer. A central processing unit (CPU) of the image processing apparatus controls the entire computer using computer programs and data stored in a RAM and a ROM. The CPU also controls the execution of software programs corresponding to the units of the image processing apparatus to realize the functions of the units.

While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all modifications, equivalent structures, and functions. 

The invention claimed is:
 1. An image processing system including at least one processor comprising: a calculation unit configured to calculate a degree of similarity among a plurality of tomographic images acquired based on a measuring beam controlled so that a same position is scanned with the measuring beam a plurality of times; a generation unit configured to generate a composite image by combining a plurality of tomographic images having a calculated degree of similarity being larger than a predetermined value; and an acquisition unit configured to acquire the plurality of tomographic images, wherein the acquisition unit is a Fourier-domain optical coherence tomography apparatus.
 2. The image processing system according to claim 1, wherein the calculation unit is configured to calculate a degree of similarity between a reference tomographic image and a tomographic image different from the reference tomographic image, and further comprising a determination unit configured to determine a tomographic image to be used by the generation unit for generating a composite image from the plurality of tomographic images based on the calculated degree of similarity.
 3. The image processing system according to claim 1, further comprising: an alignment unit configured to align the plurality of tomographic images with one another, wherein the calculation unit is configured to calculate the degree of similarity among the plurality of tomographic images after the plurality of tomographic images are aligned by the alignment unit.
 4. The image processing system according to claim 3, wherein the alignment unit is configured to align the plurality of tomographic images with one another based on boundaries between layers of the plurality of tomographic images.
 5. The image processing system according to claim 1, wherein the tomographic images are tomographic images of a subject's eye.
 6. The image processing system according to claim 5, wherein the tomographic images of the subject's eye are tomographic images of a fundus of the subject's eye.
 7. The image processing system according to claim 1, wherein the generation unit is configured to not combine tomographic images having a degree of similarity being smaller than the predetermined value.
 8. The image processing system including at least one processor comprising: a calculation unit configured to calculate a degree of similarity of a plurality of tomographic images to a first tomographic image; a generation unit configured to generate a composite image by combining a plurality of tomographic images having a calculated degree of similarity being larger than a predetermined value with the first tomographic image; and an acquisition unit configured to acquire the first tomographic image and the plurality of tomographic images, wherein the acquisition unit is a Fourier-domain optical coherence tomography apparatus, and wherein the first tomographic image and the plurality of tomographic images are acquired based on a measuring beam controlled so that a same position is scanned with the measuring beam a plurality of times.
 9. The image processing system according to claim 8, further comprising: an alignment unit configured to align the first tomographic image and the plurality of tomographic images, wherein the calculation unit is configured to calculate the degree of similarity of the plurality of tomographic images to the first tomographic image after the first tomographic image and the plurality of tomographic images are aligned by the alignment unit.
 10. The image processing system according to claim 9, wherein the alignment unit is configured to align the first tomographic image and the plurality of tomographic images based on boundaries between layers of the first tomographic image and the plurality of tomographic images.
 11. The image processing system according to claim 8, wherein the first tomographic image and the tomographic images are tomographic images of a subject's eye.
 12. The image processing system according to claim 11, wherein the tomographic images of the subject's eye are tomographic images of a fundus of the subject's eye.
 13. An image processing method executed by at least one processor, comprising: acquiring a plurality of tomographic images via an acquisition unit, the plurality of tomographic images being acquired based on a measuring beam controlled so that a same position is scanned with the measuring beam a plurality of times, wherein the acquisition unit is a Fourier-domain optical coherence tomography apparatus; calculating a degree of similarity among the plurality of tomographic images; and generating a composite image by combining a plurality of tomographic images having a calculated degree of similarity being larger than a predetermined value.
 14. An image processing apparatus including at least one processor comprising; an acquisition unit configured to acquire a plurality of two-dimensional tomographic images acquired based on a measuring beam controlled so that a same position is scanned with the measuring beam a plurality of times; and a combining unit configured to combine some of the two-dimensional tomographic images, wherein the combining unit is configured to combine some of the two-dimensional tomographic images based on a degree of similarity among the plurality of two-dimensional tomographic images, and wherein the two-dimensional tomographic images are two-dimensional tomographic images of a subject's eye.
 15. The image processing apparatus according to claim 14, wherein the acquisition unit is a Fourier-domain optical coherence tomography.
 16. The image processing apparatus according to claim 14, wherein the two-dimensional tomographic images of the subject's eye are two-dimensional tomographic images of a fundus of the subject's eye.
 17. An image processing apparatus including at least one processor comprising; an acquisition unit configured to acquire a plurality of two-dimensional tomographic images of a subject's eye acquired based on a measuring beam controlled so that a same position is scanned with the measuring beam a plurality of times; a selection unit configured to select some of the two-dimensional tomographic images; and a combining unit configured to combine the two-dimensional tomographic images selected by the selection unit, wherein the selection unit is configured to select some of the two-dimensional tomographic images based on a degree of similarity among the plurality of two-dimensional tomographic images. 